[Memory optm] loss using torch + compile #337

felipemello1 · 2025-10-07T20:06:58Z

Memory freebies

I dont think that loss/reward is a good way to check correctness here. But i compared the functions locally and they provide the same output.

JenniferWang · 2025-10-07T20:17:21Z

src/forge/util/ops.py

-    return logprobs
+
+    # Convert to fp32 for numerical stability
+    scaled_logits_fp32 = scaled_logits.float()


Noob question: what's the dtype for scaled_logits?

float becomes torch.float32

felipemello1 · 2025-10-07T22:56:12Z

@ebsmothers @Jack-Khuu @joecummings @pbontrager can some of you confirm that i dont need to do the all_gather that was happening in selective_log_softmax? Maybe its necessary if trainer.parallelism.disable_loss_parallel=False? But this is off for all of our configs.

src/forge/util/ops.py

JenniferWang · 2025-10-08T00:08:46Z

src/forge/util/ops.py

 import torch.nn.functional as F


 def selective_log_softmax(logits: torch.Tensor, index: torch.Tensor) -> torch.Tensor:


Should we also delete this function?

its used in 3 other places. I will leave it there for now. We will prob need some larger refactoring later to clean up / organize losses

casteryh · 2025-10-08T00:24:30Z

~~just curious, if we do torch.compile on the textbook implementation, do we still need manual optimization like selective_log_softmax?~~
never mind, just remove the selective_log_softmax all together.

Co-authored-by: Jiyue Wang <[email protected]>

joecummings · 2025-10-08T13:57:56Z

src/forge/actors/trainer.py


+        # compile loss
+        logger.info("Compiling loss")
+        self.loss = torch.compile(self.loss)


Is there any circumstance under which this command would fail?

cant think of one in our scenario, but if/when this happens, we can fix it

src/forge/util/ops.py

joecummings · 2025-10-08T13:59:34Z

src/forge/util/ops.py

-    logprobs = selective_log_softmax(scaled_logits, input_ids)
-    return logprobs
+
+    # Cast up to fp32 for numerical stability


nit: I would change this to something like "ensure logits are in fp32" b/c they actually could already be in fp32 and no need for "Casting up"

Co-authored-by: Joe Cummings <[email protected]>

…pile_loss

…into compile_loss

loss using torch

22e50d0

meta-cla bot added the CLA Signed This label is managed by the Meta Open Source bot. label Oct 7, 2025

JenniferWang reviewed Oct 7, 2025

View reviewed changes

add long to pass tests

294dd29

JenniferWang reviewed Oct 8, 2025

View reviewed changes

src/forge/util/ops.py Outdated Show resolved Hide resolved

JenniferWang reviewed Oct 8, 2025

View reviewed changes

JenniferWang approved these changes Oct 8, 2025

View reviewed changes

Update src/forge/util/ops.py

9b6f3b5

Co-authored-by: Jiyue Wang <[email protected]>

joecummings reviewed Oct 8, 2025

View reviewed changes

felipemello1 and others added 4 commits October 8, 2025 10:39

Update src/forge/util/ops.py

8723a5a

Co-authored-by: Joe Cummings <[email protected]>

Merge branch 'main' of https://github.com/meta-pytorch/forge into com…

d8adf06

…pile_loss

Merge branch 'compile_loss' of https://github.com/felipemello1/forge …

6b065d6

…into compile_loss

update arg name

c3a4586

felipemello1 merged commit 75815e1 into meta-pytorch:main Oct 9, 2025
8 checks passed

felipemello1 deleted the compile_loss branch October 9, 2025 14:45

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Memory optm] loss using torch + compile #337

[Memory optm] loss using torch + compile #337

Uh oh!

felipemello1 commented Oct 7, 2025

Uh oh!

JenniferWang Oct 7, 2025

Uh oh!

felipemello1 Oct 7, 2025

Uh oh!

felipemello1 commented Oct 7, 2025

Uh oh!

Uh oh!

JenniferWang Oct 8, 2025

Uh oh!

felipemello1 Oct 9, 2025

Uh oh!

casteryh commented Oct 8, 2025 •

edited

Loading

Uh oh!

joecummings Oct 8, 2025

Uh oh!

felipemello1 Oct 8, 2025

Uh oh!

Uh oh!

joecummings Oct 8, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

		import torch.nn.functional as F


		def selective_log_softmax(logits: torch.Tensor, index: torch.Tensor) -> torch.Tensor:

[Memory optm] loss using torch + compile #337

[Memory optm] loss using torch + compile #337

Uh oh!

Conversation

felipemello1 commented Oct 7, 2025

Uh oh!

JenniferWang Oct 7, 2025

Choose a reason for hiding this comment

Uh oh!

felipemello1 Oct 7, 2025

Choose a reason for hiding this comment

Uh oh!

felipemello1 commented Oct 7, 2025

Uh oh!

Uh oh!

JenniferWang Oct 8, 2025

Choose a reason for hiding this comment

Uh oh!

felipemello1 Oct 9, 2025

Choose a reason for hiding this comment

Uh oh!

casteryh commented Oct 8, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

joecummings Oct 8, 2025

Choose a reason for hiding this comment

Uh oh!

felipemello1 Oct 8, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

joecummings Oct 8, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

casteryh commented Oct 8, 2025 •

edited

Loading